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ABSTRACT 


It is well known that post-secondary science and engineer- 
ing programs attract fewer female students. In this pa- 
per, we analyze gender differences through text mining of 
over 30,000 applications to the engineering faculty of a 
large North American university. We use syntactic and se- 
mantic analysis methods to highlight differences in motiva- 
tion, interests and background. Our analysis leads to three 
main findings. First, female applicants demonstrate a wider 
breadth of experience, whereas male applicants put a greater 
emphasis on technical depth. Second, more female appli- 
cants demonstrate a greater desire to serve society. Third, 
female applicants are more likely to mention personal influ- 
ences for studying engineering. 


Keywords 
Gender differences, engineering, admissions, text mining, 
clustering. 


1. INTRODUCTION 


The failure of science and engineering programs to attract 
equal numbers of women and men is well-documented; only 
23% of women with high scores in mathematics pursue Sci- 
ence, Technology, Engineering and Mathematics (STEM) 
degrees as compared to 45% of men with the same scores 

. As a result, there has been a great deal of research on 
understanding why this is the case; see, e.g., 
[20]. The major findings of prior work are that women 
are less likely to pursue STEM degrees because they do not 
see how this leads to societal improvement, and that women 
are more often led to study engineering because of influences 
from family and friends. Prior work has also found that the 
gender gap in STEM fields is not due to a difference in tech- 
nical ability. 


One weakness of existing work is that it is based on small 
datasets collected through surveys and longitudinal studies. 
In this paper, we present a large-scale text mining study of 


this topic. Our analysis is enabled by a unique dataset of 
over 30,000 undergraduate applications to the engineering 
faculty of a large North American university. Applicants 
are required to describe why they are interested in study- 
ing engineering, and provide other relevant information such 
as reading interests, extracurricular activities and program- 
ming experience. Our goal is to determine whether female 
applicants identify different reasons for applying to an engi- 
neering program, and whether female applicants have differ- 
ent technical and extracurricular backgrounds. 


To answer these questions, we use text mining to extract 
the reasons why students apply to engineering programs. 
As in other text mining applications, challenges arise due 
to the ambiguity of natural language. To overcome these 
challenges, we rely on word embeddings and clustering to 
partition the text into semantically meaningful groups. We 
also analyze gender differences in programming languages 
and extracurricular activities through classification models 
and word frequency analyses. To the best of our knowledge, 
there is no prior work on large-scale text mining to obtain 
insights about students’ motivation and interests. 


The main findings of this paper are that women differentiate 
themselves through breadth of experience and men differen- 
tiate themselves through technical depth; women more often 
display a desire to serve society; and that women are more 
likely to mention interpersonal relationships when discussing 
their engineering goals. 


The remainder of this paper is organized as follows. Sec- 
tion |2]| summarizes related work; Section |3| discusses our 
dataset and methodology; Section |4] presents our results; 
Section [5] discusses the implications of our findings; and Sec- 
tion [6] concludes the paper with directions for future work. 


2. RELATED WORK 


There are three areas of work on gender differences in STEM. 
First are qualitative studies on small populations of students 
through interviews and surveys. Second are statistical stud- 
ies that use census data or other summary data. Third, there 
are data mining studies on student performance. These 
works span students who are in high school, already en- 
rolled in STEM programs, and who are working in a STEM 
profession. 


First, we discuss qualitative survey-based studies. 
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Diekman et al. studied 360 students from STEM and 
non-STEM fields consisting of 57.5% women. Each partici- 
pant was asked about their mathematics and science expe- 
rience and their perception of the degree to which different 
careers fulfill their personal goals. Participants’ answers re- 
flected that STEM careers impede communal-goal endorse- 
ment, which refers to how much a field enables achieving 
the goal of helping people and society. It was found that 
gender can predict communal-goal endorsement, and that 
communal-goal endorsement can negatively predict interest 
in STEM and positively predict interest in female-dominated 
programs with higher accuracy than other metrics such as 
gender or self-efficacy. Eccles found similar results on 
a larger, more comprehensive dataset. They presented a 
longitudinal study of 1500 participants from south eastern 
Michigan from 6th grade to adulthood. They found that the 
main source of gender differences in entry to STEM careers 
is not gender differences in mathematical ability, but differ- 
ences in inclinations towards society-oriented jobs. Women 
who aspire to math-related or engineering careers place a 
lower value on society-oriented job characteristics than their 
female colleagues who did not aspire to STEM careers. 


Matusovich et al. examined gender differences in val- 
ues, but only within engineering. The study was conducted 
on 6 women and 5 men who majored in engineering. Each 
student was interviewed once a year throughout their under- 
graduate degree, and asked how his or her values affect their 
decision to earn an engineering degree. Values were classi- 
fied under 4 groups: Attainment (ability to see oneself as an 
engineer), Cost (time and effort involved in their studies), 
Interest (enjoyment of understanding how math and science 
can be applied to every day life), and Utility (potential for 
future earnings). It was found that women were less likely 
to see themselves as engineers but continued to pursue an 
engineering degree due to the other values. 


More reasons to pursue engineering were observed by 
Smith [19]. Smith interviewed 17 women who were studying 
engineering at four different colleges in the United States. 
Smith observed that participants were influenced to study 
engineering by family or friends. These influences played 
a pivotal role in helping the women build self confidence 
in their mathematical and science ability. They found an 
expression of “love” towards mathematics in many cases, de- 
spite the fact that these courses were also considered diffi- 
cult. An interest in physics was found to be instrumental in 
their decision to study engineering. Women chose engineer- 
ing because it allowed them to utilize the concepts covered 
in physics without having to major in physics. However, 
gender differences were not considered. 


In terms of quantitative studies based on summary statis- 
tics, Hango (9| found that while mathematical ability plays 
a role, it does not explain gender differences in STEM ca- 
reer choices. Women with high mathematical ability are 
less likely to enter STEM fields than even men with a lower 
mathematical ability. He also supported the findings of Ec- 
cles suggesting that the gender gap in STEM programs is 
due to other factors. 


There is prior work on gender differences in STEM using 


data mining techniques [12]. However, these find- 


ings focus on student performance, whereas our work focuses 
on students’ motivations for studying STEM, and their non- 
academic experiences and backgrounds. 


Finally, there exists work on gender differences in comput- 
ing, but it focuses on attitudes toward computing and pro- 
ficiency with basic tasks [18]. Instead, we focus on 
reported programming language knowledge. 


To the best of our knowledge, our work is the first one that 
conducts a data driven analysis of the reasons why students 
want to pursue engineering, and calculates the gender dif- 
ferences in these reasons. We also study past employment 
experiences, and programming knowledge in an effort to cap- 
ture a more holistic view of the personalities of women and 
men who apply to engineering. In our conclusions, we verify 
some of the results of previous studies, and add to others. 


3. DATA AND METHODOLOGY 
3.1 Data 


Our dataset comes from the engineering faculty of a large 
North American university. It contains all applications — 
both accepted and rejected — to the 14 available engineering 
programs from 2013 to 2016 inclusive. Table }1] shows the 
number of applications and the gender distribution of the 
applicants to each program, sorted by percentage of female 
students. The dataset includes gender, first choice program, 
and short free text responses to the following fields: 


1. Engineering interests and goals: explain why you are 
interested in engineering and the specific program to 
which you applied. 


2. Reading interests: discuss a book or an article you 
enjoyed or that has had an impact on you (preferably 
something that was not part of a course at school). 


3. List any extracurricular activities or areas of signifi- 
cant interest. 


4. List any jobs you held throughout high school. 


5. Only mandatory for applicants to Software Engineer- 
ing: list any programming experience you have. 


6. Additional information: tell us anything else about 
yourself that you would like us to know when we review 
your application. 


We report results for three groups of applicants: Biomedical 
and Environmental Engineering (BEE), Software Engineer- 
ing (SE) and all other programs (OTHER). We initially an- 
alyzed applications to each program separately but observed 
applicants to programs within OTHER to be similar in the 
trends they display. Notably, the gender split in BEE is eq- 
uitable, unlike other programs which are male-dominated. 
Furthermore, SE has unique application requirements (pro- 
gramming knowledge) and requires additional analysis. 
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Table 1: Gender breakdown by program 


Table 2: Families of programming languages 


Program Applicants % Women % Men Family Constituent Programming Languages 
Environmental 1021 53% 47% Java java, bluej, jython, android 
Biomedical 2015 52% 48% C++ c++, beta 
Chemical 3612 38% 62% Python python 
System Design 957 38% 62% HTML/CSS _ html, html5, css, css3 
Management 1040 36% 64% C C, objective-C, robote : 
Civil 3375 28% 72% JavaScript javascript, jscript, jquery, angularjs 
Geological 361 25% 75% Turing turing, touring 
Nanotechnology 1670 24% 76% C# cH, visual c# 
Electrical 3782 17% 83% Php php 
Computer 3931 16% 84% SQL sql, pl/sql 
Software 3635 14% 86% Other cnet, ada, alice, applescript, bash, etc 
Mechanical 5473 12% 88% 
Mechatronics 2886 12% 88% 
Total 33758 23% 77% 3.2.2 Analysis of Programming Experience 


3.2. Methodology 


We use syntactic and semantic methods to analyze the free 
text responses. Syntactic methods identify words mentioned 
by more men or women, or words that can predict gender. 
Additionally, we apply semantic methods to “Engineering 
Interests and Goals” to capture context and extract the rea- 
sons why men and women want to study engineering. 


3.2.1 Syntactic Analysis 

For each of the six free text fields, we first perform standard 
pre-processing: we remove stop words, tokenize the text 
and stem the tokens using the NLTK snowball stemmey| 
We then perform two syntactic analyses on each field: 


Document Frequencies: we identify words used at least 
once by a larger fraction of men or women (where each re- 
sponse is considered a document). We only report statis- 
tically significant differences with a P-value of 0.05 using a 
proportion test (6). 


Gender Prediction: we build classifiers to predict gen- 
der based on the words or contiguous sequences of words 
(bigrams and trigrams) appearing in a free text response. 
Following previous work on text classification, we use logis- 
tic regression |8] where the dependent variable is gender, and 
the explanatory variables correspond to the possible words 
(or word bigrams/trigrams), and their values correspond to 
their TF-IDF scores [21]. To calculate a TF-IDF score 
for a given word and a given response, we divide the num- 
ber of times the word appears in the response by the Inverse 
Document Frequency - the fraction of responses in the entire 
dataset containing this word. TF-IDF is a useful measure 
because it balances the uniqueness of a term in the corpus 
and the importance of the term to the specific document. 
For each free text field except programming experience, we 
report the F-measure, which is the weighted harmonic mean 
of precision and recall I, and accuracy, both calculated us- 
ing 10-fold cross validation. We use oversampling for SE 
and OTHER to control for gender imbalance; otherwise, a 
classifier that always predicts gender as “male” would have 
a high accuracy on any male-dominated dataset. 


‘http://www.nltk.org/_modules/nltk/stem/snowball html 


In the “Programming Experience” field, SE applicants are 
asked to list their programming experience. The structure 
of this question elicits not only specific programming lan- 
guages, but also encourages applicants to share details about 
their programming experience. Thus, in addition to the doc- 
ument frequency analysis mentioned earlier, we perform the 
following detailed analyses: 


e Programming Language analysis: we calculate the 
number of responses that mention a given program- 
ming language. We start with a list of known lan- 
guages from Wikipediq?| We then add common mis- 
spellings of these languages, and we group them into 
families in consultation with a domain expert. Table] 
shows the language families whose frequencies we will 
report. 


e Programming Concept analysis: we compile a list of 
computing concepts, a sample of which is shown in 
Table[3] group them into categories, and calculate the 
number of responses that mention a given concept. 


e Learning Method analysis: we compile a list of on- 
line programming courses, and common variations of 
“high school”, “self taught”, “higher education”, and 
“employment”. We then categorize these terms accord- 
ing to how an applicant learned programming: “on- 
line”, “high school”, “self taught”, “higher education”, 
“work”, and “other”. Finally, we calculate the number 


of responses that mentioned each learning method. 


e Experience analysis: we extract the amount of expe- 
rience reported by an applicant by searching for the 
words “hour”, “day”, “month”, “year”, as well as com- 
mon abbreviations and misspellings of these words. 
We use the token immediately preceding these words 
to determine the length of time. We convert all of the 


times into months. 


3.2.3 Semantic Analysis of Engineering Interests 

Using the responses to “Engineering Interests and Goals”, 
we want to identify the reasons why students apply to en- 
gineering programs. However, reasons cannot be inferred 


“https: / /en.wikipedia.org/wiki/List_of_programming languages 
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Table 3: Sample of programming concepts 


Concept Category Constituent Concepts 


Basic array, list, loop, if-statement 

Data Structures stack, queue, linked list 

Sorting merge sort, bubble sort, quick sort 
Searching linear, binary, breadth first searches 
OOP object, class, abstraction, encapsulation 
Data Science machine learning, NLP 

Other storage, memory management 


Table 4: Nine questions used with the QA API 


Question No. Question Variant 


Table 5: Sentences extracted from a particular response 
using all 9 question variants 


Question No. Answer produced by the QA API 


1 future entrepreneurship ventures 

2 designing & building complicated solutions 
3 future entrepreneurship ventures 

4 intellectual curiosity and satisfaction 


is core to my personality 


5 i think i fit in well in the tight culture of 
the engineering class 


6 intellectual curiosity and satisfaction 
is core to my personality 


Why are you interested in Engineering? 

What inspired you to study Engineering? 

What do you find inspiring about Engineering? 
What are the reasons you like Engineering? 

Why do you feel the need to pursue Engineering? 
Why are you passionate about Engineering? 
Why does Engineering interest you? 

Why do you want to study Engineering? 

Why do you like Engineering? 


COBNATKWNHH 


simply by counting occurrences of certain keywords; for ex- 
ample, family influence may be expressed by using words 
such as “father”, “mother”, “uncle”, or simply, “family”. Fur- 
thermore, an applicant may mention things other than the 
exact reason as to why they are interested in engineering 
in their response. Our semantic approach deals with these 
issues through the use of Question Answering to isolate top- 
ics being mentioned that could be considered indicative of 
reasons, followed by Clustering using Word Embeddings to 
analyze these. Figure }1} shows the steps in our semantic 
analysis, and they are explained in detail below. 


1. Question Answering (QA): Here, we extract sen- 
tences that are most likely to contain the topics indicative 
of the applicants’ underlying reasons for applying to engi- 
neering. We use a state of the art QA network which 
is available as an open source APF} Given a question and a 
text document, this QA API extracts a sentence that may 
answer the question. However, we discovered that while 
asking the question that directly appeared on the entrance 
application - why are you interested in engineering - yielded 
some relevant sentences, there were additional relevant sen- 
tences that were not identified. To address this problem, we 
consulted with domain experts at the institution and for- 
mulated additional variants of this question. Depending on 
the applicant, not every variant identified a unique sentence. 
Overall, we observed that the number of unique sentences 
extracted per applicant plateaued at nine question variants. 
Table[J]lists the nine variants we use and Table[5]gives an ex- 
ample of the sentences extracted from a particular response 
using each question. 


2. Stop Word Removal: Next, we remove stop words 
from the sentences extracted in the previous step because 
these do not contain any meaningful information about the 
underlying reasons. Similarly, we remove words excessively 


3https: / /github.com /allenai/bi-att-flow 


8 intellectual curiosity and satisfaction 
is core to my personality 


know people much closer 


Ko) 


it’s the best program available 


used by both genders such as “engineering” and the name 
of the university. This step happens after QA because QA 
requires the complete text, stop words included, as input. 


3. Sentence Vector Computation: At this point, each 
response has produced up to nine relevant sentences. We 
use word embeddings to capture semantic proximity between 
sentences. Specifically, we use the word2vec model [14], 
trained on the Google news corpug'| to convert each word 
into a 300-dimensional vector that encodes the underlying 
semantics. We then use the average of all word vectors in a 
sentence as its sentence vector. If two sentence vectors are 
close, the sentences are also semantically similar [1]. 


4. Clustering of Sentence Vectors: Next, we cluster 
the sentence vectors received from the previous step using 
K-Means clustering with Euclidean distance as the similar- 
ity metric and kK = 200, where K is the number of clus- 
ters (the rationale behind this choice of K will be discussed 
shortly). The clusters converge around similar topics. For 
example, sentences containing words related to family such 
as “brother”, “father”, or “sister” have similar word vectors 
and are more likely to be assigned to the same cluster. Note 
that this would not be the case had we clustered the sen- 
tences themselves according to their syntactic similarity. 


5. Cluster Representative Extraction: After comput- 
ing clusters of sentence vectors, we extract representative 
words from each cluster to identify the topic of that clus- 
ter. First, we map sentence vectors back to the original sen- 
tences, which creates 200 sets of sentences, one set for each 
cluster. We then tokenize and stem the text in each set, as 
described in Section The word2vec model consumes 
unstemmed words, compelling us to postpone tokenization 
and stemming until this step. The trigrams in each set are 
ranked using their TF-IDF scores calculated considering all 
200 sets as the corpus. Finally, we represent each cluster 
with a list of 10 top ranking trigrams, an example of which 
is shown in Table [6] 


“https: / /code.google.com/archive/p/word2vec/ 
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Merging 


Clustering of Sentences into Topics 


Figure 1: Semantic analysis methodology 


Table 6: An example of ten trigrams representing a cluster 


mw 
2 
3 
an 


Trigram 


solv problem solv 
problem solv problem 
enjoy problem solv 
problem solv enjoy 
enjoy solv problem 
love solv problem 
problem solv love 
love problem solv 
problem problem solv 
solv problem problem 


COONMDOUEWNHH 


BH 


Table 7: Examples of a mixed cluster using kK = 50 and its 
pure equivalent using K = 200 


Mixed Cluster (50 Clusters) Pure Cluster (200 Clusters) 


kid work day kid work day 
apart tri togeth young age father 
love thing apart visit construct site 
countless hour spent watch father work 
use everi day older brother mechan 
pay close attent dad electr took 
decid high school work day dad 
work day saw like help father 
day day basi expos young age 
year high school uncl civil engin 


Choice of K: In Step 4, we experimented with values of 
ranging from 50 to 200. When choosing a small K and pro- 
ceeding to Step 5 with fewer clusters, many clusters were 
represented by trigrams that were not semantically similar 
enough to warrant being in the same cluster, and some un- 
common trigrams were overpowered by extremely common 
ones. Thus, some nuanced topics were lost as they could 
not form a cluster of their own. Larger values of kK resulted 
in the splitting out of semantically similar topics. These re- 
sulted in pure but redundant clusters, i.e., several clusters 
representing the same topic. For instance, Table |’7| shows 
a cluster of mixed topics on the left when K is 50, and a 
rather pure cluster on the right when K is 200. A bigger K 
made it possible for topics like “kid work day” to be grouped 


with similar semantic contexts like “watch father work”. The 
topics on the right consistently speak of the influence of a 
family member, indicative of family influence as a reason 
for engineering, whereas no single reason can be deduced 
from the cluster on the left. The first AK value that pro- 
duced adequately pure clusters was 200. Thus, the decision 
was made to stop testing larger values and creating further 
unnecessary redundancy. To eliminate the unnecessary re- 
dundancies at K = 200, the clusters were merged in Step 
6. 


6. Cluster Merging: At this point we have 200 clusters of 
sentences, where each cluster is represented by the 10 high- 
est ranking trigrams. To make the clusters interpretable 
and to group them under more general topics, we manually 
merge similar clusters based on their 10-trigram representa- 
tions to produce ten final clusters. This process of merging 
follows the Card-sorting approach. Card-sorting has been 
widely used to systematically derive taxonomies from data, 
to reach a higher level of abstraction, and identify common 
themes [22]. For instance, it can be used to sort responses 
to an open-ended question into bins to deduce themes over 
the responses. We perform card-sorting on the representa- 
tive trigrams, then we brand each of the ten final themes 
with human interpretable labels and consider these our final 
topics. In this process, a number of small clusters whose 
representatives were vague were disregarded, but even then, 
99.5% of applicants were labelled with at least one topic. 
Table|8]shows two examples of representatives of vague clus- 
ters. Since the QA in Step 1 used questions probing the 
reasons why the applicant was applying to engineering, our 
topics can be considered indicative of the same. 


Table [9] shows the final set of topics along with sample tri- 
gram representations of clusters that were classified under 
each topic. Technical Interests refers to characteristics in- 
herent to engineering along with topics related to specific 
engineering disciplines. For instance, the trigram “water 
treatment plant” in Table [9| is part of Technical Interests 
while being specifically related to Environmental Engineer- 
ing. 


All the sentences classified under a specific topic in Table 
[9] are tracked back to the applicants who mentioned them. 
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Table 8: Examples of discarded vague clusters 


Example #1 


Example #2 


appli program appli 
appli chemic program 
program appli appli 
program program appli 
program appli program 
program appli electr 
appli mechan program 
mechan program appli 
appli electr program 
program appli chemic 


pursu decid pursu 
experi inspir pursu 
pursu motiv pursu 
pursu passion believ 
pursu wish pursu 
encourag pursu pursu 
hope continu pursu 
encourag pursu believ 
passion inspir pursu 
desir pursu educ 


Table 9: The final set of ten topics, with representative 
word trigrams of the clusters classified under each topic 


Reason Trigrams (stemmed) 
Famil follow footstep father 
aay. older brother mechan 


Contribution to Society 


improv peopl live 
make world better 
make contribut societi 


Outreach 


attend open hous 
talk student professor 


The statistics presented in the next section are based on 
the number of applicants who mention a given topic, and 
hence indicate the same underlying reason for their interest 
in engineering 


4. RESULTS 

We now describe our results, treating applicants to BEE, 
SE and OTHER separately, as mentioned in Section 
Section presents syntactic (word frequencies and logistic 
Section|tA and semantic (question answering & clustering) 
results for “Engineering Interests and Goals”. Section 
describes the detailed analyses of programming experience 
(only for applicants to SE). The remaining sections discuss 
the results of frequency analysis and logistic regression for 
the remaining fields: job titles, reading interests, extracur- 
ricular activities, and additional information. 


4.1 Engineering Interests and Goals 
4.1.1 Syntactic Analysis 


Document Frequencies: Overall, there are more terms 
that are used predominantly by women, indicating that 
women use a wider variety of terms. We see more women 
using non technical terms to express themselves, and men 
using more technical terms. 


In BEE, more men mention “mechanical” (11.5% of men vs. 
8.2% of women), and “compute” (8.5% of men vs. 5.3% of 
women. More women mention “health” (16.4% of women vs. 
10.6% of men), “improve” (23.6% of women vs. 18% of men), 
“love” (24.8% of women vs. 20.5% of men), and “research” 
(20.6% of women vs. 16.5% of men). 


In SE, more men mention “system” (14.2% of men vs. 9.6% 
of women), “problem” (25.5% of men vs. 20.9% of women), 
“game” (19.1% of men vs. 14.9% of women), and “goal” 
(25.7% of men vs. 21.5% of women). More women mention 
“science” (49.9% of women vs. 43.0% of men), “research” 
(11.0% of women vs. 6.9% of men), “challenge” (18.6% of 
women vs. 14.7% of men), and “people” (20% of women vs. 
16.3% of men). 


In the OTHER group of engineering programs, more men 
mention “mechanical” (28.9% of men vs. 7.5% of women), 
“compute” (25.8% of men vs. 17.2% of women), “robot” 
(16.2% of men vs. 9.9% of women),“car” (9.9% of men vs. 
4.1% of women), and “goal” (24.3% of men vs. 19.4% of 
women). More women mention “chemical” (21.9% of women 
vs. 10.7% of men), “science” (41% of women vs. 32.6% 


creat new technolog 
water treatment plant 
use dismantl toy 
develop medic technolog 


Technical Interests 


math physic chemistri 


Love of Science : 
S love math scienc 


book watch video 
robot competit team 
particip extracurricular activ 


Extracurriculars 


leadership communic skill 


Prior Accomplishments profici skill mathemat 


talk physic teacher 
high school student 


pursu graduat studi 
job opportun engin 
futur career goal 


High School 


Professional Development 


began young age 


Childhood Dream dream childhood dream 


Table 10: F-Measure/Accuracies for predicting gender 
using Engineering Interests & Goals (in %) 


Group Unigram Bigram Trigram 
BEE 60/60.7 60/59.1 57/58 
OTHER 72/78.8 76/80.4 80/77.3 
SE 88/86 98/97.2 94/94 


of men), “creative” (16.1% of women vs. 10.2% of men), 
“study” (30.7% of women vs. 25.3% of men), and “love” 
(24.2% of women vs. 19.4% of men). 


Logistic Regression: Table [10] shows the results for pre- 
dicting gender using words from responses to “Engineering 
Interests and Goals”. The predictive power of logistic re- 
gression decreases with increasing gender balance within a 
group, even after oversampling to compensate for the initial 
gender imbalance. In other words, in programs with an even 
gender split, it is more difficult to guess the gender. 


4.1.2 Semantic Analysis 

We classified the sentences extracted from students’ re- 
sponses under one of ten topics shown in Table [9] Table 
shows the percentage of applicants to BEE who men- 
tioned each topic. The most common topics are Technical 
Interests and Love of Science. More women mention Love 
of Science, which is statistically significant with a P-value 
of 0.03. No other topic had a statistically significant gen- 
der difference. On average, female students in this group 
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Table 11: BEE applicants’ topics 


Topic %All %Women %Men P-value 


Table 13: OTHER applicants’ topics 


Topic %All %Women %Men P-value 


Family 10.9% 11.3% 10.5% 0.47 

Contribution to Society 20.7% 20.5% 20.9% 0.77 
Outreach 8.5% 9.3% 7.7% 0.12 

Technical Interests 86.8% 86.8% 86.8% 0.97 

Love of Science 32.3% 34.1% 30.4% 0.03 
Extracurriculars 5.7% 5.6% 5.9% 0.69 

Prior Accomplishments 6.1% 5.9% 6.3% 0.61 
High School 8.8% 8.4% 9.2% 0.46 

Professional Development 25.3% 25.9% 24.7% 0.47 
Childhood Dream 2.5% 2.7% 2.2% 0.32 


Table 12: SE applicants’ topics 


Topic %All %Women %Men P-value 


Family 12.3% 13.0% 12.1% 0.064 

Contribution to Society 14.7% 16.1% 14.3% 0.00 
Outreach 8.1% 9.9% 7.6% 0.00 

Technical Interests 88.4% 89.0% 88.3% 0.149 

Love of Science22.7% 26.6% 21.7% 0.00 
Extracurriculars 9.0% 7.8% 9.3% 0.00 

Prior Accomplishments 6.6% 7.0% 6.5% 0.18 
High School 10.3% 9.8% 10.5% 0.13 

Professional Development 26.6% 27.5% 26.3% 0.07 
Childhood Dream 3.7% 3.0% 3.9% 0.00 


Table 14: Female students’ topics across all groups 


Topic %SE % BEE % OTHER 


Family 7.6% 11.0% 7.0% 0.00 

Contribution to Society 12.1% 12.5% 12.0% 0.77 
Outreach 8.7% 9.1% 86% 0.703 

Technical Interests 92.6% 92.3% 92.6% 0.77 

Love of Science 13.9% 16.6% 13.4% 0.05 
Extracurriculars 9.2% 10.9% 9.0% 0.17 

Prior Accomplishments 6.1% 7.1% 5.9% 0.30 
High School 11.0% 12.9% 10.7% 0.15 

Professional Development 25.0% 27.7% 24.6% 0.13 
Childhood Dream 2.7% 2.8% 2.6% 0.87 


mention 2.12 topics whereas male students mention 2.05, a 
statistically insignificant difference with a P-value of 0.06. 


Table[12|shows the percentage of applicants to SE who men- 
tioned each reason. The most common reasons are Techni- 
cal Interests and Professional Development. Women men- 
tion Family more frequently than men, which is statisti- 
cally significant with a P-value of 0.00. No other reason had 
a Statistically significant gender difference. On average, fe- 
male students in this program mention 2.04 reasons whereas 
male students mention 1.87, a statistically significant differ- 
ence with a P-value of 0.00. 


Table[13]shows the percentage of applicants to OTHER en- 
gineering programs who mentioned each topic. The most 
common topics are Technical Interests and Professional De- 
velopment. Female students mention Contribution to So- 
ciety, Outreach, and Love of Science more than male 
students, which is statistically significant with a P-value of 
0.00. Male students mention Extracurriculars and Child- 
hood Dream more than female students, which is statis- 
tically significant with a P-value of 0.00. No other topic 
had a statistically significant gender difference. On average, 
female students in this group mention 2.1 topics whereas 
male students mention 2.0 reasons, a statistically significant 
difference with a P-value of 0.00. 


Table[I4] highlights the differences between women who ap- 
plied to BEE vs. women who applied to SE vs. women who 
applied to OTHER programs. The bold values show per- 
centage differences from the other two groups that are sta- 
tistically significant with a P-value of less than 0.05. Female 
applicants to SE, BEE, and OTHER programs differ from 
each other in their mentions of Contribution to Society, 


Family 111% 113% 13.0% 
Contribution to Society 12.5% 20.5% 16.1% 
Outreach 9.1% 9.3% 9.9% 


Technical Interests 92.3% 86.8% 89.0% 

Love of Science 16.6% 34.1% 26.6% 
Extracurriculars 10.9% 5.6% 7.8% 

Prior Accomplishments 7.1% 5.9% 7.0% 
High School 12.9% 8.4% 9.8% 

Professional Development 27.7% 25.9% 27.5% 
Childhood Dream 2.8% 2.7% 3.0% 


Technical Interests, Love of Science, and Extracur- 
riculars with a P-value of less than 0.05. Mentions of High 
School are only different in SE applicants compared to other 
groups, which is statistically significant with a P-value of 
less than 0.05. No other topic had a statistically significant 
difference. 


Table[15] highlights the differences between men who applied 
to BEE vs. men who applied to SE vs. men who applied to 
OTHER. The bold values show percentage differences from 
the other two groups that are statistically significant with a 
P-value of less than 0.05. Male applicants to SE, BEE, and 
OTHER programs differ from each other in their mentions 
of Contribution to Society and Love of Science with a 
P-value of less than 0.05. Mentions of Family and Techni- 
cal Interests are only different for SE applicants compared 
to applicants to other programs, which is statistically sig- 
nificant with a P-value of less than 0.05. Mentions of Ex- 
tracurriculars are different for BEE applicants compared 
to applicants to other program groups, which is statistically 
significant with a P-value of less than 0.05. No other topic 
had a statistically significant difference. 


4.2 Reading Interests 

Document Frequencies: Overall, men tend to report 
reading technical content such as research papers and women 
report reading novels and writing that has a societal focus. 
Words that are predominantly used by men include “arti- 
cle” (17.6% of men vs. 13.4% of women), “enjoy” (29.5% 
of men vs. 25.6% of women), “compute” (5.6% of men vs. 
2.2% of women), and “science” (12.3% of men vs. 10.3% 
of women). Words that are predominantly used by women 
include “love” (20.3% of women vs. 12.6% of men), “novel” 
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Table 15: Male students’ topics across all groups 


Topic %SE % BEE % OTHER 


Family 7.0% 10.5% 12.1% 

Contribution to Society 12.0% 20.9% 14.3% 
Outreach 8.6% 7.7% 7.6% 

Technical Interests 92.6% 86.8% 88.3% 

Love of Science 13.4% 30.4% 21.7% 
Extracurriculars 9.0% 5.9% 9.3% 

Prior Accomplishments 5.9% 6.3% 6.5% 
High School 10.7% 9.2% 10.5% 

Professional Development 24.6% 24.7% 26.3% 
Childhood Dream 2.7% 2.2% 3.9% 


Table 16: F-Measures/Accuracies for predicting gender 
using words from Reading Interests (in %) 


Group Unigram Bigram Trigram 
BEE 64/63 62/60 60/54.2 
OTHER 79/77.8 93/89.8 95/91.8 
SE 92/88.9 96/95.6 93/91.8 


(31.2% of women vs. 24.6% of men), “character” (20.3% of 
women vs. 15.2% of men), “women” (6.1% of women vs. 
1.1% of men), “people” (29.1% of women vs. 24.9% of men), 
and “family” (10.7% of women vs. 6.8% of men). 


Logistic Regression: The results for predicting gender 
based on Reading Interests are shown in Table[16] As before, 
the predictive power of logistic regression decreases with in- 
creasing gender balance within the group. 


4.3 Extracurricular Activities 

Document Frequencies: Overall, male applicants’ ex- 
tracurricular activities have a technical focus, and female 
applicants have a wide breadth of experiences ranging from 
leadership to artistic pursuits. 


In BEE, more men mention “robot” (7% of men vs. 3.6% 
of women) and “coach” (7.1% of men vs. 4.8% of women). 
More women mention “dance” (8.7% of women vs. 1.7% of 
men), “art” (11.3% of women vs. 6.9% of men), “council” 
(21.5% of women vs. 15.6% of men), and “lead” (21.1% of 
women vs. 16.8% of men). 


In SE, more men mention “compute” (20.9% of men vs. 
13.7% of women). More women mention “art” (14.5% of 
women vs. 4.8% of men), “council” (20.5% of women vs. 
11.9% of men), “dance” (8.3% of women vs. 2.2% of men), 
and “lead” (18.7% of women vs. 14.3% of men). 


In the OTHER group of engineering programs, more men 
mention “robot” (11.1% of men vs. 6.3% of women), “com- 
pute” (5.8% of men vs. 2.4% of women). More women men- 
tion “dance” (10.7% of women vs. 2.1% of men), “council” 
(20% of women vs. 12.1% of men), “art” (11.9% of women 
vs. 4.8% of men), “volunteer” (22.9% of women vs. 16.3% 


of men), and “lead” (19% of women vs. 13.1% of men). 


Logistic Regression: The results for predicting gender 
based on Extracurricular Activities are shown in Table [17] 


Table 17: F-Measures/Accuracies for predicting gender 
using words from Extracurricular Activities (in %) 


Group Unigram Bigram Trigram 
BEE 72/72.9 69/66.6 62/59.5 
OTHER 81/81.1 80/77.8 78/71.4 
SE 85/83.3 85/82 94/93.4 


Table 18: F-Measures/Accuracies for predicting gender 
using words from Job Titles (in %) 


Group Unigram Bigram Trigram 
BEE 59/57.9 58/52.6 63/51 
OTHER 65/61.5 64/59.1 67/61.9 
SE 67/63.7 66/58.7 68/51.7 


The predictive power of logistic regression decreases with 
increasing gender balance within the group. 


4.4 Job Titles 

Document Frequencies: Across all programs, men are 
more likely to mention terms that imply technical work or 
manual labour, whereas women are more likely to mention 
terms that imply customer service and caring professions. 
Example words in job titles from male applicants include 
“referee” (4.1% of men vs. 2% of women), “labor” (2.6% of 
men vs. 0.5% of women), and “technician” (3.1% of men vs. 
1.2% of women). Example words in job titles from female 
applicants include “cashier” (12.8% of women vs. 6.8% of 
men), “teacher” (6.2% of women vs. 2.7% of men), and 
“assist” (17.6% of women vs. 14.3% of men). 


Logistic Regression: As shown by the logistic regression 
scores in Table [18] Job Titles do not provide as much pre- 
dictive power as other fields. 


4.5 Programming Experience 
4.5.1 Syntactic Analysis 


Document Frequencies: In general, women use more non 
technical terms, and men use more technical terms. Exam- 
ples of terms that are more commonly used by male appli- 
cants include “game” (30.8% of men vs. 22.3% of women) 
and “develop” (21.5% of men vs. 14.4% of women), and 
terms more commonly used by female applicants include 
“mark” (39.9% of women vs. 30.6% of men) and “attend” 
(4.2% of women vs. 1.4% of men). Through manual inspec- 
tion, we discovered that “mark” was used in the context of 
earning a certain mark in a course. “attend” was used to 
indicate attendance in a programming workshop or event. 


Logistic Regression: As shown in Table the words 
used to describe programming experience can be used to 
predict the gender of the applicant. 


4.5.2 Programming Languages 

Table[20|shows a comparison of specific language knowledge 
between male and female applicants. All languages except 
for SQL are slightly skewed toward male applicants; how- 
ever, only Java, C++, C, Turing, C# have statistically 
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Table 19: F-Measures/Accuracies for predicting SE 
applicants’ gender using Programming Experience (in %) 


Table 21: F-Measures/Accuracies for predicting gender 
using Additional Information (in %) 


Group Unigram Bigram Trigram Group Unigram Bigram Trigram 
SE 91/88.8 98/98 97/95.7 BEE 60/58.4 52/51.3 53/50 
OTHER 78/77.3 81/77 93/89.2 
SE 86/83.7 93/86.3 93/95.2 


Table 20: Comparison of reported programming language 
knowledge 


Language % Women % Men _ Difference P-value 


Java 58.9% 65.6% -6.7% 0.00 

C++ 23.3% 28.5% -5.2% 0.01 
Python 25.1% 28.1% -3.0% 0.18 
HTML/CSS 19.0% 19.5% -0.5% 0.75 


Basic 16.1% 18.6% -2.5% 0.114 

C 12.5% 17.0% -4.5% 0.01 
JavaScript 12.7% 15.0% -2.3% 0.17 
10.8% 14.3% -3.5% 0.03 
CH 6.1% 9.4% -3.3% 0.01 
Php 3.9% 8.3% -4.4% 0.00 


SQL 3.9% 3.2% -0.7% 0.50 
Other 31.1% 16.4% -3.3% 0.07 
None 4.7% 3.2% +1.4% 0.07 


significant differences with a P-value of less than 0.05. In 
these cases, we only see differences ranging from 4% to 6%. 


Men on average report experience with 2.43 programming 
languages, whereas women report experience with 2.05 lan- 
guages, a significant result with a P-value of less than 0.05. 


4.5.3 Programming Concepts 

Among applicants who mentioned specific programming 
concepts, women reported Basic Language Knowledge, 
which includes loops, if-statements, and variables, 14% more 
than male applicants did. This result is significant with a 
P-value of less than 0.05. 


There are small differences in mentions of data science, 
object oriented programming, sorting, searching, and data 
structures. However, these results were not statistically sig- 
nificant, so we cannot conclude that there is a gender differ- 
ence in any mention of programming concepts. 


4.5.4 Learning Method 

We found that men were slightly more likely to learn how to 
program through employment or self-learning, and women 
were more likely to learn how to program in high school, 
through higher education, and through online courses. This 
result is not statistically significant with a P-value of greater 
than 0.05, so we cannot conclude that there is a gender dif- 
ference in how men and women learn how to program. 


4.5.5 Experience 

On average, women report 6 months of programming experi- 
ence, and men report 8 months of programming experience. 
This result is not significant with a P-value of greater than 
0.05, so we cannot conclude that there is a gender difference 
in the amount of experience within applicants to SE. 


4.6 Additional Information 


Document Frequencies: We see a difference in word 
choice between men and women when answering a question 
with no restrictions on the content of their answer. 


In BEE, more men mention “sport” (10.9% of men vs. 7.1% 
of women) and “compute” 4.7% of men vs. 2.3% of women). 
More women mention “educate” (17.2% of women vs. 12.2% 
of men), “science” (17.9% of women vs. 13.4% of men), 
“develop” (15.1% of women vs. 10.7% of men), “community” 
(14.8% of women vs. 10.8% of men), and “create” (8.5% of 
women vs. 5.0% of men). 


In SE, more men mention “compute” (27.8% of men vs. 
20.8% of women) and “game” (9.2% of men vs. 3.8% of 
women). More women mention “attend” (16.7% of women 
vs. 10.4% of men), “English” (12.8% of women vs. 7.2% of 
men), “study” (21.5% of women vs. 16.5% of men), “parent” 
(8.7% of women vs. 3.7% of men), “love” (14.2% of women 
vs. 10.1% of men), and “creative” (8.7% of women vs. 4.6% 
of men). 


In the OTHER programs, more men mention “sport” (10.2% 
of men vs. 5.7% of women) and “team” (16.3% of men 
vs. 12.4% of women). More women mention “art” (7.3% 
of women vs. 3.3% of men), “volunteer” (9.9% of women vs. 
6.4% of men), and “passion” (13.6% of women vs. 10.4% of 
men). 


Logistic Regression: The results for predicting gender 
based on Additional Information are shown in TableBI] As 
before, the predictive power of logistic regression decreases 
with increasing gender balance within the group. 


5. DISCUSSION 


5.1 Similarities 

Regardless of gender, the most commonly mentioned topic 
in responses to “Why are you interested in engineering?” 
is Technical Interests. Female and male applicants seem to 
share the same interest in Engineering in all program groups. 
SE applicants show more technical interest in engineering 
than other programs. 


In general, female and male applicants to SE mention the 
same motivation for studying engineering. Family is more 
popular among female applicants, not because female appli- 
cants to SE mention it more compared to other programs, 
but because male applicants talk about it less than men in 
other programs, as can be seen in Tables [15] and [14] 


In SE, we do not see a large gender gap in self reported pro- 
gramming experience, as shown in Table This suggests 
that students who are exposed to computer science do not 
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differentiate themselves through the number of languages 
they learn, nor in the amount of programming experience. 


In BEE, the differences between female and male applicants 
are minimal. We see evidence for this in the semantic anal- 
ysis presented in Section 4.1.2 where there is only one topic 
that shows a gender difference, and we observe this in our 
inability to reliably predict gender based on any question as 


shown in Tables and 


Based on Tables and Contribution to Society and 
Engineering Interests are inversely proportional, regardless 
of gender. 


5.2 Differences 
5.2.1 Depth vs. Breadth 


The overarching gender difference throughout the analysis 
is that men differentiate themselves through depth of ex- 
perience, and women through breadth of experience. To 
study engineering, all applicants must demonstrate knowl- 
edge in mathematics and sciences through their academic 
work. However, we see male applicants differentiating them- 
selves by highlighting their initiative to acquire more techni- 
cal skills through their work experience, extracurricular ac- 
tivities, reading interests, and the topics they mention when 
asked why they are studying engineering. Female applicants 
differentiate themselves through demonstrating a wide range 
of experiences and capabilities. This is suggested by the fact 
that women mention a wider variety of topics when asked 
why they are studying engineering, their extracurricular ac- 
tivities place an emphasis on leadership and artistic pursuits, 
they often take service jobs, and they choose to discuss more 
non technical reading material. 


In SE, men are more likely to report technical extracurricu- 
lars, as seen in Section 4.3, even though there is only a small 
gender difference in the reported amount of programming 
experience. This provides further justification that women 
differentiate themselves through breadth of experience even 
when they are extremely technically focused. 


The gender difference in depth versus breadth is much 
smaller in BEE. The difference in the number of topics men- 
tioned between men and women is the smallest across these 
two programs. We also only see a statistically significant 
difference between men and women in one topic, love of sci- 
ence, which is extremely common across all applicants. The 
small difference is consistent with our inability to predict 
gender in BEE. 


We also see this in the syntactic analysis of reasons, where 
women mention “improve” and “health” more in the BEE 
group, and “people” more in the SE group. It is an interest- 
ing difference because BEE includes programs that focus on 
helping others, and SE is often the farthest removed from 
directly working with people. 


5.2.2 Desire to Serve Society 

Women show a stronger desire to contribute to society and 
improve the world around them. We see this in their mo- 
tivation to study engineering in “Engineering Interests and 
Goals” in the OTHER group of programs where they are 


more likely to mention “Contribution to society”. We also 
see this in the syntactic analysis of this field where they men- 
tion “improve” and “health” in the BEE group, and “people” 
in the SE group. This is also evident in their work experience 
where women mention “assist” and “teacher” more often than 
men. Finally, we see this in extracurricular activities, where 
women mention “volunteer” more frequently than men. Our 
findings in this section agree with [4]. 


5.2.3 Influence 


Women are more likely to mention personal influences in 
their decision to study engineering. This is prevalent in 
SE, where women mention “Family” reasons more than men. 
This expands on the findings in [19]. 


6. CONCLUDING REMARKS 


The main findings of this paper are that men differentiate 
themselves through having technical depth in their experi- 
ences, and women differentiate themselves through having a 
breadth of experiences. We see similar behavior in Software 
Engineering, even though women and men show similar lev- 
els of technical know-how. We see smaller gender differences 
in applicants to Biomedical and Environmental Engineering 
where there is gender equity. Finally, women mention more 
of a desire to serve society, and they mention more interper- 
sonal reasons for studying engineering than men. 


We infer that in order to attract more women to study en- 
gineering, it must be presented as a profession that can help 
others and allow for a broad range of careers and learning 
opportunities. A key part in fostering this new image of en- 
gineering lies in encouragement from family and role models 
who practice engineering. 


6.1 Future Work 


In future work, we intend to conduct data driven analysis 
of gender differences at various stages in STEM students’ 
academic careers; e.g., investigating the effects of university- 
sponsored outreach and mentorship programs on applicants, 
and correlating depth and breadth of expression at the time 
of admission to academic and career success. We also plan 
to investigate and compare gender differences in graduate 
school applications to those in undergraduate applications. 
We also want to expand the scope of our studies to include 
non STEM programs in our analysis, and conduct compar- 
isons of differences in STEM vs. non-STEM programs. 
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